XQuake: an XML-based Knowledge Discovery Environment
نویسندگان
چکیده
Data mining is the analysis of large volumes of data to find unsuspected relationships and to summarize the data in novel ways, that are both understandable and useful to the data owner. Nowadays, the rapid growth of semi-structured sources raises the need of designing and implementing environments for data mining out of XML data. On the basis of the principles of the inductive database theory, this dissertation presents a flexible data mining system with capabilities of obtaining, maintaining, representing and querying induced, deduced and prior knowledge, stored inside native XML databases. In particular, it summarizes our three-years experience in the design and development of XQuake, a query language that extends XQuery to support mining primitives. Features of the language are an intuitive syntax, a good expressiveness, and the capability of dealing uniformly with data mining entities. A detail of its implementation and the evaluation of its performance are also given.
منابع مشابه
XQuake as a Constraint-Based Mining Language
XQuake is a language for data mining inspired by the inductive databases theory. This work extends XQuake with the definition of domain-specific constraints. An ontology is used to describe the domain knowledge. We give the main idea of the work-inprogress discussing its possibilities and advantages.
متن کاملGrid-Based Knowledge Discovery Services for High Throughput Informatics
Discovery Net is an application layer for providing gridbased knowledge discovery services. These services allow scientists to create and manage complex knowledge discovery workflows that integrate data and analysis routines provided as remote services. They also allow scientists to store, share and execute these workflows as well as publish them as new services. Discovery Net provides a higher...
متن کاملQuerying Compressed Knowledge Bases in Pervasive Computing
In the so-called Semantic Web of Things (SWoT), annotated information is tied/derived to/from micro-devices, such as RFID tags and wireless sensors, deployed in an environment. Compression techniques are so needed, due to the verbosity of semantic XML-based languages. Beyond compression ratio, query efficiency is a key aspect for knowledge discovery in mobile ad-hoc scenarios where resources ar...
متن کاملAn XML Framework Proposal for Knowledge Discovery in Databases
In recent years, the XML language has been receiving much interest among IT community. It has many nice properties that make it a great candidate for representation of different kinds of data. In this paper we will propose an XML framework for the domain of knowledge discovery in databases. This is not a specification document; this article tries to be a compendium of ideas and remarks concerni...
متن کاملData Mining and XML: Current and Future Issues
The paper describes potential synergies between data mining and XML, which include the representation of discovered data mining knowledge, knowledge discovery from XML documents, XML-based data preparation, and XML-based domain knowledge. Each category is viewed from a theoretical as well as a practical point of view.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009